STIC Biotechnology Syst ems Branch 

RAW SEQUENCE LISTING ^ 
ERROR REPORT 



The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: /o/^'^ / 

Source: ' ^ i — r? ^= 

Date Processed by STIC: ^ X~ //- ^ Z d^S'^ 

THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION AND PATENTIN SOFTWARE QUESTIONS, PLEASE CONTACT 
MARK SPENCER, TELEPHONE: 571-272-2510; FAX: 571-273-0221 



TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 4.2.2 PROGRAM. ACCESSIBLE THROUGH THE U S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 
http://www.uspto.gov/web/ofnces/pac/checker/chkrnote.htm 



Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be aware tiuit there is 

a possibiUty Uiat Uie disk/CD-Roni may have been affected by treatment given to all incoming mail. 

Please consider using alternate meUiods of submission for Uie disk/CD-Rom or replacement disk/CD-Rom. 

Anv reply inchiding a sequence li-sting in electronic form shoul d NOT be sent to the 2023 1 zip code address for tiic 

United States Patent and Trademark Office, and instead should be sent v ia the following to Uie mdicated addresses: 

I. FFS-Rin Khttp://www.usDto.gov/ebc/efs/downloads/docum ents.htm> , EPS Submission 

User Manual - ePAVE) 

2 U.S. Postal Sei-vice: Conmiissioner for Patents, P.O. Bo-x 1450, Alexandria, VA 22.1 13-1430 

3 Hand Carry, Federal Express, United Parcel Service, or other delivery service (EFFECTIVE 01/14/05): 
U.S. Patent and Trademark Office, Mail Stop Sequence, Customer Window, Randolph Building. 40 1 Dulany Street. 
Alexandria. VA 22314 



Revised 01/24/05 




Raw Sequence Listing Error Summary 



KRROR DETECTED SUGGESTED CORRECTION SERIAL NUMBER: 

ATTN: NEW RULES CASES; PLEASE DISREGARD ENGLISH "ALPHA" HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 



1 Wrapped Nucleics 

Wrapped Aminos 



!2 



The number/text at the end of each line "wrapped" down to the next line. Phis may occur if your file 
was retrieved in a word processor after creating it. Please adjust your right margin to .3; this will 
prevent "wrapping." 



Invalid Line Length The rules require that a line not exceed 72 characters in length. This includes white spaces. 



3 Misaligned Amino 

Numbering 

4 Non-ASCII 



The numbering under each 5'^ amino acid is misaligned. Do not use tab codes between numbers; 
use space characters, instead. 

The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 
ensure your subsequent submission is saved in ASCII text. 



_Variable Length Sequence(s) contain n's or Xaa's representing more than one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having vari-able length and indicate in the <220>-<223> section that some may be missing. 

Patentin 2.0 A "bug" in Patentin version 2.0 has caused the <220>-<223> section to be missing from amino acid 

"bug" sequences(s) . Normally, Patentin would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 



7 Skipped Sequences Sequence(s) 

(OLD RULES) 



missing. If intentional, please insert the following lines for each skipped sequence: 



(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 

(xi) SEQUENCE DESCRIPTION:SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to include the skipped sequences. 




Skipped Sequences 
(NEW RULES) 



Use of n's or Xaa's 
(NEW RULES) 



Invalid <213> 
Response 



Sequence(s) _ 



missing. If intentional, please insert the following lines for each skipped sequence. 



<210> sequence id number 
<400> sequence id number 
000 

Use of n's and/or Xaa's have been detected in the Sequence Listing. 

Per 1 .823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

Per 1.823 of Sequence Rules, the only valid <2I3> responses are: Unknown. Artificial Sec|uence. or^ 
scientific name ( Genus/species). <2 20>-<22 3> when <-3 l3. > renpnnrte i rt Ij fttawwn or 



Useof<220> 



Sequence(s) 



missing the <220> "Feature" and associated numeric identifiers and responses. 



_Patentln 2.0 
"bug'' 



Use of <220> to <223> is MANDATORY if <2I3> "Organism" response is "Artificial Sequence" or 

"Unknown." Please explain source of genetic material in <220> to <223> section. 

(See "Federal Register," 06/01/1998, Vol. 63, No. 104, pp. 29631-32) (Sec. 1.823 of Sequence Rules) 

Please do not use "Copy to Disk" function of Patentin version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 



1 3 Misuse of n/Xaa "n" can only represent a single nucleotide : "Xaa" can only represent a single amino acid 



AMC - Biotechnology Systems Branch - 09/09/2003 
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RAW SEQUENCE LISTING DATE: 07/11/2005 

PATENT APPLICATION: US/10/540,615 TIME: 15:15:15 

Input Set : A:\PTO.DA.txt 

Output Set: N:\CRF4\07112005\J540615.raw 

4 <110> APPLICANT: CENTER FOR GENETIC ENGINEERING AND BIOTECHNOLOGY 

6 <120> TITLE OF INVENTION: RECOMBINANT HEPATITIS A VIRUS ANTIGENS PRODUCED IN PLANT 

CELLS . 

9 <13 0> FILE REFERENCE: ORF . 
C--> 11 <140> CURRENT APPLICATION NUMBER: US/10/540,615 
C--> 12 <141> CURRENT FILING DATE: 2005-06-23 

14 <160> NUMBER OF SEQ ID NOS : 24 
16 <170> SOFTWARE: Patentin Ver. 2.1 



ERRORED SEQUENCES 



18 <210> SEQ ID NO: 

19 <211> LENGTH.i-7.a6^ 1^ K 1 

20 <2i2> type'.Q^dnJ^ C®G^r©eti®d Bl 

21 <213> ORGANIS^T: fchimferic SequenceMjj-^ . ^ 
24 <22 0> FEATURE: / ^ ^ i. I 



N ^0 

25 <221> NAME/KEY: primer bind // >, ^ ''J ^ . J/5 ;jt4- i W Cii^^ 





26 <222> LOCATION: (1) . . (25) 

27 <223> OTHER INFORMATION: Sequence .. . , . ^ ^ . , 

28 Sequence of the oligonocleotide # 1 used for the amplification ox ORF 
2 9 coding sequence by RT-PCR 

32 <400> SEQUENCE: 1 

33 cttaatctag aatgaatatg tcc^aa 25 

36 <210> SEQ ID NO: 2 

37 <211> LENGT 

38 <212> TYPE 
39 
41 

42 <221> NAME/KEY: primer_bind 

43 <222> LOCATION: (1) . , (22) 

44 <223> OTHER INFORMATION: Sequence # 2. 

45 Sequence of the oligonocleotide # 2 used for the amplification of ORF 

46 coding sequence by RT-PCR 

47 <400> SEQUENCE: 2 

48 gaaagaaata aaggtacctc ag QJ^ 22 



JTH<-^ Q 

<213> ORGANI^^^^^imeHF'"s^ 
<220> FEATURE: • ^Y^''^ 



5' 




51 <210> SEQ ID NO: 3 

52 <211> LENGTH^^.,.^^5 
E--> 53 <212> TYPEc^AeSi 

54 <213> ORGANrSM-r^ Hepatitis A virus 

56 <220> FEATURE: 

57 <221> NAME/KEY: gene 

58 <222> LOCATION: Complement ( (1) . . (6685) ) 

59 <223> OTHER INFORMATION: Sequence # 3. 
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RAW SEQUENCE LISTING DATE: 07/11/2005 

PATENT APPLICATION: US/10/540,615 TIME: 15:15:15 

Input Set : A:\PTO.DA.txt 

Output Set: N:\CRF4\07112005\J540615 .raw 

60 Nucleotide sequence coding for the HAV open reading frame (ORF) of the 

61 Cuban M2 strain. 

64 <400> SEQUENCE: 3 

65 atgaatatgt ccaaacaagg aattttccag actgttggga gtggccttga ccacatcctg 60 

66 tccttggcag atattgagga agagcaaatg attcagtccg ttgataggac tgcagtgact 12 0 

67 ggagcttctt atttcacttc tgtggaccaa tcttcagttc atactgctga ggttggctca 180 

68 caccaaattg aacctttgaa aacctctgtt gataaacctg gttctaagaa aactcagggg 240 

69 gagaagtttt tcttgattca ttctgctgat tggctcacta cacatgctct ctttcatgaa 300 

70 gttgcaaaat tggatgtggt gaaactgctg tacaatgagc agtttgccgt ccaaggtttg 360 

71 ttgagatacc atacttatgc aagatttggc attgagattc aagttcagat aaatcccaca 420 

72 ccctttcagc aaggaggact aatctgtgcc atggttcctg gtgaccaaag ttatggttca 480 

73 atagcatcct tgactgttta tcctcatggt ctgttaaatt gcaatatcaa caatgtagtt 540 

74 agaataaagg ttccatttat ttatactaga ggtgcttatc attttaaaga tccacagtac 600 

75 ccagtttggg aattgacaat cagagtttgg tcagagttga atattggaac aggaacctca 660 

76 gcttatactt cactcaatgt tttagctagg tttacagatt tggagttgca tggattaact 720 

77 cctctttcta cacagatgat gagaaatgaa tttagagtta gtactactga aaatgttgta 780 

78 aatttgtcaa attatgaaga tgcaagggca aaaatgtctt ttgctttgga tcaggaagat 840 

79 tggaagtctg atccttccca aggtggtgga attaaaatta ctcatttcac tacctggaca 900 

80 tccattccaa ccttagctgc tcagtttcca ttcaatgctt cagattcagt tgggcaacaa 960 

81 attaaagtta taccagtgga cccatacttt ttccagatga caaacactaa tcctgatcaa 1020 

82 aaatgtataa cagccttggc ctctatttgt cagatgttct gcttttggag gggagatctt 1080 

83 gttttcgatt tccaggtttt tccaaccaaa tatcattcag gtaggctgtt gttttgtttt 1140 

84 gttcctggga atgagttaat agatgttact ggaattacat taaaacaggc aactactgct 1200 

85 ccttgtgcag tgatggacat tacaggagtg cagtcaacct tgagatttcg tgttccttgg 1260 

86 atttctgata caccctatcg agtgaatagg tacacgaagt cagcacatca aaaaggtgag 1320 

87 tatactgcca ttgggaagct tattgtgtat tgttataata gattgacttc tccttctaat 1380 

88 gttgcttctc atgttagagt taatgtttat ctttcagcaa ttaatttgga atgttttgct 1440 

89 cctctttacc atgctatgga tgttaccaca caggttggag atgattcagg aggtttctca 1500 

90 acaacagttt ctacagagca gaatgttcct gatccccaag ttggcataac aaccatgagg 1560 

91 gatttaaaag ggaaagccaa taggggaaag atggatgtat caggagtgca ggtacctgtg 1620 

92 ggagctatta caacaattga ggatccagtt ttagcaaaga aagtacctga gacatttcct 1680 

93 gaattgaagc ctggagaatc cagacataca tcagatcaca tgtctattta taaattcatg 1740 

94 ggaaggtctc atttcttgtg tacttttact tttaattcaa acaataaaga gtacacattt 1800 

95 ccaataactc tgtcttcgac ttctaatcct cctcatggtt taccatcaac attaaggtgg 1860 

96 ttctttaatt tgtttcagtt gtatagagga ccattggatt tgacaattat aatcacagga 1920 

97 gccactgatg tggatggtat ggcctggttt actccagtgg gccttgctgt cgacacccct 1980 

98 tgggtggaaa agaagtcagc tttgtctatt gattataaaa ctgcccttgg agctgttaga 2040 

99 tttaatacaa gaagaacagg gaacattcag attagattgc catggtattc ttatttgtat 2100 

100 gccgtgtctg gagcactgga tggcttggga gataagacag attctacatt tggattggtt 2160 

101 tctattcaga ttgcaaatta caatcattct gatgaatatt tgtcctttag ttgttatttg 2220 

102 tctgtcacag agcaatcaga gttctatttc cctagagctc cattaaattc aaatgctatg 2280 

103 ttgtccactg agtccatgat gagtagaatt gcagctggag acttggagtc atcagtggat 2 340 

104 gatcccagat cagaggagga cagaagattt gagagtcata tagaatgtag gaaaccatat 2400 

105 aaagaattga gactggaggt tgggaaacaa agaatcaaat atgctcagga agagttatca 2460 

106 aatgaagtgc ttccacctcc taggaaaatg aaggggttat tttcacaagc taaaatttct 2520 

107 cttttttata cagaggacca tgaaataatg aaattttctt ggagaggagt gactgctaat 2580 

108 actagggctt tgagaagatt tggattctct ctggctgctg gtagaagtgt gtggactctt 2640 

109 gaaatggatg ctggagttct tactggaaga ttgatcagat tgaatgatga gaaatggaca 2700 

110 gaaatgaagg atgataagat tgtttcatta attgaaaagt tcacaagcaa taaacattgg 2760 
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RAW SEQUENCE LISTING DATE: 07/11/2005 

PATENT APPLICATION: US/10/540,615 TIME: 15:15:15 



Input Set : A:\PTO.DA.txt 

Output Set: N:\CRF4\07112005\J540615.raw 

111 tctaaagtga attttccaca tggaatgttg gatcttgagg aaattgctgc caactctaaa 2820 

112 gattttccaa atatgtctga gacagatttg tgtttcctgt tgcattggct aaatccaaag 2880 

113 aaaattaatt tagcagatag aatgcttgga ttgtctggag tgcaggaaat taaagaacag 2 940 

114 ggtgttggac tgatagcaga gtgtagaact ttcttggatt ctattgctgg gactttgaaa 3000 

115 tctatgattt ttgggtttca ttattctgtg actgttgaaa ttataaatat tgtgctttgt 3060 

116 tttattaaga gtggaatcct gctttatgtc atacaacaat tgaaccaaga tgaacactct 3120 

117 cacataattg gtttgttgag agttatgaat tatgcagata ttggctgttc agtcatttca 3180 

118 tgtggtaaag ttttttccaa aatgttagaa acagttttta attggcaaat ggactctaga 3240 

119 atgatggagc tgaggactca gagcttctcc aattggttaa gagatatttg ttcgggaatt 3300 

120 actattttta aaagttttaa ggatgccata tattggttat gtacaaaatt gaaggatttt 3360 

121 tatgaagtaa attatggcaa gaaaaaggat gttcttaata ttctcaaaga taaccagcaa 3420 

122 aaaatagaaa aagccattga agaagcagac aatttttgca ttttgcaaat tcaagatgtg 3480 

123 gagaaatttg atcagtatca gaaaggggtt gatttaatac aaaagctgag aactgtccat 3540 

124 tcaatggctc aagttgaccc cagtttgggg gttcatttgt cacctctcag agattgcata 36 00 

125 gcaagagtcc atcaaaagct caagaatctt ggatctataa atcaggccat ggtaacaaga 3660 

126 tgtgagccag ttgtttgcta tttgtatggc aaaagagggg gagggaaaag cttgacttca 372 0 

127 attgcattgg caaccaaaat ttgtaaacac tatggtgttg aacctgagaa aaatatttac 3780 

128 accaaacctg tggcctcaga ttattgggat ggatatagtg gacaattagt ttgtattatt 3840 
12 9 gatgatatcg gccaaaacac aacagatgaa gattggtcag atttttgtca attagtgtca 3 900 

130 ggatgcccaa tgagattgaa tatggcttct cttgaggaga agggcagaca tttttcctct 3960 

131 ccttttataa tagcatcttc aaattggtca aatccaagtc caaaaacagt ttatgttaaa 4020 

132 gaagcaattg atcgtaggct tcattttaag gttgaagtta aacctgcttc attttttaaa 4080 

133 aatcctcaca atgatatgtt aaatgttaat ttggctaaaa caaatgatgc aattaaagac 4140 

134 atgtcttgtg ttgatttgat aatggatgga cacaatattt cattgatgga tttacttagt 4200 

135 tccttagtga tgacaggtga aattaggaaa cagaatatga gtgaattcat ggagttgtgg 4260 

136 tctcagggaa tttcagatga tgacaatgat agtgcagtag ctgagttttt ccggtctttt 4320 

137 ccatctggtg aaccatcaaa ttccaagtta tctagttttt tccaagctgt cactaatcac 4380 

138 aagtgggttg ctgtgggagc tgcagttggt attcttggat tgctagtggg aggatggttt 4440 

139 gtgtataagc atttttcccg caaagaggaa gaaccaattc cagctgaagg ggtttatcat 4500 

140 ggagtgacta agcccaaaca agtgattaaa ttggatgcag atccagtaga gtctcagtca 4560 

141 actctagaaa tagcaggatt agttaggaaa aatttggttc agtttggagt tggtgagaaa 4620 

142 aatggatgtg tgagatgggt catgaatgcc ttaggagtga aggatgattg gttgttagta 4680 

143 ccttctcatg cttataaatt tgaaaaggat tatgaaatga tggagtttta tttcaataga 4740 

144 ggtggaactt actattcaat ttcagctggt aatgttgtta ttcaatcttt agatgtggga 4800 

145 ttccaagatg ttgttctaat gaaggttcct acaattccca agtttagaga tattactcaa 4860 

146 cattttatta agaaaggaga tgtgcctaga gccttgaatc gcttggcaac attagtgaca 4920 
14 7 accgttaatg gaactcctat gttaatttct gagggacctt taaaaatgga agaaaaagcc 4 980 

148 acttatgttc ataagaagaa tgatggtact acggttgatt tgactgtaga tcaggcatgg 5040 

149 agaggaaaag gtgaaggtct tcctggaatg tgtggtgggg ccctagtgtc atcaaatcag 5100 

150 tccatacaaa atgcaatttt gggtattcat gttgctggag gaaattcaat tcttgtggca 5160 

151 aagttgatta ctcaagaaat gtttcaaaac attgataaga aaattgaaag tcagagaata 5220 

152 atgaaagtgg aatttactca atgttcaatg aatgtagtct ccaaaacgct ttttagaaag 5280 

153 agtcccattc atcaccacat tgataaaacc atgattaatt ttcctgcagc tatgcctttc 5340 

154 tctaaagctg aaattgatcc aatggctatg atgttgtcta aatattcatt acctattgtg 5400 

155 gaagaaccag aggattacaa agaagcttca gttttttatc aaaataaaat agtaggcaag 5460 

156 actcagctag ttgatgactt tctagatctt gatatggcca ttacaggggc tccaggcatt 552 0 

157 gatgctatta atatggattc atctcctggg tttccttatg ttcaagaaaa attgactaaa 5580 

158 agagatttga tttggttgga tgaaaatggt ttactgttag gagttcaccc aagattggcc 5640 

159 cagagaatct tatttaatac tgtcatgatg gaaaattgtt ctgacttaga tgttgttttt 5700 
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RAW SEQUENCE LISTING DATE: 07/11/2005 

PATENT APPLICATION: US/10/540,615 TIME: 15:15:15 

Input Set : A:\PTO.DA.txt 

Output Set: N:\CRF4\07112005\J540615.raw 

160 acaacttgtc caaaagatga attgagacca ttagagaaag ttttggaatc aaaaacaaga 5760 

161 gctattgatg cttgcccttt ggattataca attttatgtc gaatgtattg gggtccagct 5820 

162 attagttatt ttcatttgaa tccagggttt cacacaggtg ttgctattgg catagatcct 5880 

163 gatagacagt gggatgaatt atttaaaaca atgataagat ttggagatgt tggtcttgat 5940 

164 ttagattttt ctgcttttga tgccagtctt agtccattta tgattaggga agcaggtaga 6000 

165 atcatgagtg aattatctgg aacaccatct cattttggaa cagctcttat caatactatc 6060 

166 atttattcta aacatctgct gtacaattgt tgttatcacg tctgtggttc aatgccttct 6120 

167 gggtctcctt gtacagcttt gttgaattca attattaata atattaattt gtattatgtg 6180 

168 ttttctaaaa tatttggaaa gtctccagtt ttcttttgtc aagctttgag gatcctttgt 6240 

169 tatggagatg atgttttgat agttttttcc agagatgttc aaattgataa tcttgacttg 6300 

170 attggacaga aaattgtgga tgagttcaaa aaacttggca tgacagccac ttcagctgac 6360 

171 aaaaatgtgc ctcaactgaa gccagtttca gaattgactt ttcttaaaag atcttttaat 6420 

172 ttggtggagg 'acagaatcag acctgcaatt tcagaaaaga caatttggtc tttgatagct 6480 

173 tggcagagaa gtaacgctga gtttgagcag aatttagaaa atgctcagtg gtttgctttc 6540 

174 atgcatggct atgagttcta tcagaaattc tattattttg ttcagtcctg tttggagaaa 6600 

175 gagatgatag aatatagact taaatcttat gattggtgga gaatgagatt ttatgaccag 6660 

176 tgtttcattt gtgacctttc atgat / 1 ■ 6685 

179 <210> SEQ ID NO: 4 >pi ^ \J JCJ^^ Mf^'f 

180 <2ii> LENGTH^^-p^PN^; r l ^ ' r) /Xy^^^'^'^^^ 

E--> 181 <212> TYPEi^Dr *^ „VC/^ J 

182 <213> ORGANiSMTf Chimeric 
184 <220> FEATURE: 
. 185 <221> NAME/KEY: prTmerT>ind 

186 <222> LOCATION: (1) . . (40) 

187 <223> OTHER INFORMATION: Sequence # 4. 

188 Sequence of the oligonocleotide # 5 used for the 

189 amplification of P1-2A coding sequence by PCR. 

192 <400> SEQUENCE: 4 

193 ttgaattcag cttgtgaaaa taaccccttc attttcctag (l4v/^ 




196 <210> SEQ ID NO: 5 ^^^lA*^ 




197 <211> LENGTHj 
E--> 198 <212> TYPEi 

199 <213> ORGAlftS^j^^himeric Sequence 

201 <220> FEATURE: 

202 <221> NAME/KEY: primer__bind~ 

203 <222> LOCATION: (1) . . (28) 

204 <223> OTHER INFORMATION: Sequence # 5. 

206 Sequence of the oligonocleotide .# 5 used for the 

207 amplification of P1-2A coding sequence by PCR. 

209 <400> SEQUENCE: 5 

210 cgcccgggtc tagaatgaat atgtccaa ^ ^ 28 

213 <210> SEQ ID NO: 6 

214 <211> LENGTH^.jj,,2J&2,3^ 
E--> 215 <212> TYPE^^N 

216 <213> ORGANtOTT^Hepatitis A virus 

218 <220> FEATURE: 

219 <22i> NAME/KEY: gene 

220 <222> LOCATION: Complement ( (1) . . (2523) ) 

221 <223> OTHER INFORMATION: Sequence # 6. 



:aa 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/540,615 



DATE: 07/11/2005 
TIME: 15:15:15 



E- 



222 
223 
226 
227 
228 
229 
230 
231 
232 
233 
234 
235 
236 
237 
238 
239 
240 
241 
242 
243 
244 
245 
246 
247 
248 
249 
250 
251 
252 
253 
254 
255 
256 
257 
258 
259 
260 
261 
262 
263 
264 
265 
266 
267 
268 
269 
272 
273 
274 



Input Set : A:\PTO.DA.txt 

Output Set: N:\CRF4\07112005\J540615.raw 

Nucleotide sequence coding for the structural 
P1-2A HAV proteins of the M2 strain. 
<400> SEQUENCE: 6 

atgaatatgt ccaaacaagg aattttccag actgttggga gtggccttga 
tccttggcag atattgagga agagcaaatg attcagtccg ttgataggac 
ggagcttctt atttcacttc tgtggaccaa tcttcagttc atactgctga 
aacctttgaa aacctctgtt gataaacctg gttctaagaa 
tcttgattca ttctgctgat tggctcacta cacatgctct 
tggatgtggt gaaactgctg tacaatgagc agtttgccgt 
atacttatgc aagatttggc attgagattc aagttcagat 
aaggaggact aatctgtgcc atggttcctg gtgaccaaag 
tgactgttta tcctcatggt 
ttccatttat ttatactaga 
aattgacaat cagagtttgg 
cactcaatgt tttagctagg 
cacagatgat gagaaatgaa 
attatgaaga tgcaagggca 
atccttccca aggtggtgga 
ccttagctgc tcagtttcca 



caccaaattg 
gagaagtttt 
gttgcaaaat 
ttgagatacc 
ccctttcagc 
atagcatcct 
agaataaagg 
ccagtttggg 
gcttatactt 
cctctttcta 
aatttgtcaa 
tggaagtctg 
tccattccaa 
attaaagtta 
aaatgtataa 
gttttcgatt 
gttcctggga 
ccttgtgcag 
atttctgata 
tatactgcca 
gttgcttctc 
cctctttacc 
acaacagttt 
gatttaaaag 
ggagctatta 
gaattgaagc 
ggaaggtctc 
ccaataactc 
ttctttaatt 
gccactgatg 
tgggtggaaa 
tttaatacaa 
gccgtgtctg 
tctattcaga 
tctgtcacag 
ttgtccactg 
gatcccagat 



ctgttaaatt gcaatatcaa 

ggtgcttatc attttaaaga 



taccagtgga cccatacttt 
cagccttggc ctctatttgt 



tcagagttga 
tttacagatt 
tttagagtta 
aaaatgtctt 
attaaaatta 
ttcaatgctt 
ttccagatga 
cagatgttct 



atattggaac 
tggagttgca 
gtactactga 
ttgctttgga 
ctcatttcac 
cagattcagt 
caaacactaa 
gcttttggag 



tccaggtttt 
atgagttaat 
tgatggacat 
caccctatcg 
ttgggaagct 
atgttagagt 
atgctatgga 



tccaaccaaa 
agatgttact 
tacaggagtg 
agtgaatagg 
tattgtgtat 
taatgtttat 
tgttaccaca 



ctacagagca gaatgttcct 
ggaaagccaa taggggaaag 
caacaattga ggatccagtt 
ctggagaatc cagacataca 
atttcttgtg tacttttact 
tgtcttcgac ttctaatcct 
tgtttcagtt gtatagagga 
tggatggtat ggcctggttt 
agaagtcagc tttgtctatt 
gaagaacagg gaacattcag 



tatcattcag gtaggctgtt 
ggaattacat taaaacaggc 
cagtcaacct tgagatttcg 
tacacgaagt cagcacatca 
tgttataata gattgacttc 
ctttcagcaa ttaatttgga 
caggttggag atgattcagg 
gatccccaag ttggcataac 
atggatgtat caggagtgca 
ttagcaaaga aagtacctga 
tcagatcaca tgtctattta 
tttaattcaa acaataaaga 
cctcatggtt taccatcaac 
ccattggatt tgacaattat 
actccagtgg gccttgctgt 
gattataaaa ctgcccttgg 
attagattgc catggtattc 



gagcactgga tggcttggga gataagacag attctacatt 



ttgcaaatta caatcattct 
agcaatcaga gttctatttc 
agtccatgat gagtagaatt 
cagaggagga cagaagattt 
aaagaattga gactggaggt tgggaaacaa 
aatgaagtgc ttccacctcc taggaaaatg 
gat 

<210> SEQ ID NO: 7 
<211> LENGTH: 27 
<212> TYPE t^^^^yi 



gatgaatatt tgtcctttag 
cctagagctc cattaaattc 
gcagctggag acttggagtc 
gagagtcata tagaatgtag 
agaatcaaat atgctcagga 
aaggggttat atgcttctgg 



ccacatcctg 

tgcagtgact 
ggttggctca 
aactcagggg 
ctttcatgaa 
ccaaggtttg 
aaatcccaca 
ttatggttca 
caatgtagtt 
tccacagtac 
aggaacctca 
tggattaact 
aaatgttgta 
tcaggaagat 
tacctggaca 
tgggcaacaa 
tcctgatcaa 
gggagatctt 
gttttgtttt 
aactactgct 
tgttccttgg 
aaaaggtgag 
tccttctaat 
atgttttgct 
aggtttctca 
aaccatgagg 
ggtacctgtg 
gacatttcct 
taaattcatg 
gtacacattt 
attaaggtgg 
aatcacagga 
cgacacccct 
agctgttaga 
ttatttgtat 
tggattggtt 
ttgttatttg 
aaatgctatg 
atcagtggat 
gaaaccatat 
agagttatca 
aggtgaattc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2523 
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ir M:2 

12 M:2 
20 M:3 
38 M:3 
53 M:3 
181 M: 
198 M: 
;215 M: 
274 M: 
;291 M: 
;308 M: 
:328 M: 
;348 M: 
;365 M: 
;382 M: 
:416 M: 
;492 M: 
:508 M: 
:525 M: 
:535 M: 
:600 M: 
:617 M: 
:633 M: 
:637 M: 
:650 M: 
:654 M: 
;667 M: 
:726 M: 
:759 M: 



70 C: 

71 C: 
10 E: 
10 E: 
10 E: 
310 E 
310 E 
310 E 
310 E 
310 E 
310 E 
310 E 
310 E 
310 E 
310 E 
310 E 
310 E 
310 E 
310 E 
259 W 
310 E 
310 E 
310 E 
257 W 
310 E 
257 W 
310 E 
310 E 
310 E 



VERIFICATION SUMMARY DATE: 07/11/2005 

PATENT APPLICATION: US/10/540,615 TIME: 15:15:16 

Input Set : A:\PTO. DA .txt 

Output Set: N:\CRF4\07112005\J540615.raw 

Current Application Number differs, Replaced Application Number 
Current Filing Date differs. Replaced Current Filing Date 
(3) Wrong or Missing Sequence Type, TYPE: 
(3) Wrong or Missing Sequence Type, TYPE: 



(3) 


Wrong or Missing Sequence Type, 


TYPE: 


(3) 


Wrong 


or 


Missing 


Sequence 


Type, 


TYPE 


(3) 


Wrong 


or 


Missing 


Sequence 


Type, 


TYPE 


(3) 


Wrong 


or 


Missing 


Sequence 


Type, 


TYPE 


(3) 


Wrong 


or 


Missing 


Sequence 


Type, 


TYPE 


(3) 


Wrong 


or 


Missing 


Sequence 


Type, 


TYPE 


(3) 


Wrong 


or 


Missing 


Sequence 


Type, 


TYPE 


(3) 


Wrong 


or 


Missing 


Sequence 


Type, 


TYPE 


(3) 


Wrong 


or 


Missing 


Sequence 


Type, 


TYPE 


(3) 


Wrong 


or 


Missing 


Sequence 


Type, 


TYPE 


(3) 


Wrong 


or 


Missing 


Sequence 


Type, 


TYPE 


(3) 


Wrong 


or 


Missing 


Sequence 


Type, 


TYPE 


(3) 


Wrong 


or 


Missing 


Sequence 


Type, 


TYPE 


(3) 


Wrong 


or 


Missing 


Sequence 


Type, 


TYPE 


(3) 


Wrong 


or 


Missing 


Sequence 


Type, 


TYPE 



Allowed number of lines exceeded, <223> Other Information: 

(3) Wrong or Missing Sequence Type, TYPE: 

(3) Wrong or Missing Sequence Type, TYPE: 

(3) Wrong or Missing Sequence Type, TYPE: 
Feature value mis-spelled or invalid, <221> Name/Key for SEQ ID#:20 

(3) Wrong or Missing Sequence Type, TYPE: 

Feature value mis-spelled or invalid, <221> Name/Key for SEQ ID#:21 
(3) Wrong or Missing Sequence Type, TYPE: 
(3) Wrong or Missing Sequence Type, TYPE: 
(3) Wrong or Missing Sequence Type, TYPE: 
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RAW SEQUENCE LISTING DATE: 07/11/2005 

PATENT APPLICATION: US/10/540,615 TIME: 15:15:15 

Input Set : A:\PTO.DA.txt 

Output Set: N:\CRF4\07112005\J540615.raw 

632 <211> LENGTH^_55 ^ ^ ~~^<P^><^u^ ^^XX^^T^ 

E--> 633 <212> TYPEC^ mF^ 

634 <213> ORGANISM: (Chimeric Sequence 

636 <220> FEATURE: 
W--> 637 <221> NAME/KEY: Dsegment 

638 <222> LOCATION: (1) . . (54) 

63 9 <223> OTHER INFORMATION: Sequence #20 

640 synthetic fragment modifying the 3' end of 

641 the 2A protein and introduces a space-bar 

642 between this one and the KDEL signal. 

644 <400> SEQUENCE: 20 

645 cctaggaaaa tgaaggggtt atatgcttct ggaggtgaat tcgatatcaa ggatg 55 



The type of errors shown exist throughout 
the Sequence Listing . Please check subseq uent 
sequences for similar'ferrors". ' 
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RAW SEQUENCE LISTING DATE: 07/11/2005 

PATENT APPLICATION: US/10/540,615 TIME: 15:15:15 



573 
574 
575 
576 
577 
578 
579 
580 
581 
582 
583 
584 
585 
586 
587 
588 
589 
590 
591 
592 
593 
594 
595 
598 
599 

E--> 600 
601 
603 
604 
605 
606 
607 
608 
609 
611 
612 
615 
616 

E--> 617 
618 
620 
621 
622 
623 
624 
625 
627 
628 
631 



Input Set : A:\PTO.DA.txt 

Output Set: N:\CRF4\07112005\J540615.raw 

attcagattg caaattacaa tcattctgat gaatatttgt cctttagttg ttatttgtct 
gtcacagagc aatcagagtt ctatttccct agagctccat taaattcaaa tgctatgttg 
tccactgagt ccatgatgag tagaattgca gctggagact tggagtcatc agtggatgat 
cccagatcag aggaggacag aagatttgag agtcatatag aatgtaggaa accatataaa 
gaattgagac tggaggttgg gaaacaaaga atcaaatatg ctcaggaaga gttatcaaat 
gaagtgcttc cacctcctag gaaaatgaag gggttatttt cacaagctga attcctgcag 
cccgggggat ccatgggaat ttcagatgat gacaatgata gtgcagtagc tgagtttttc 
cggtcttttc catctggtga accatcaaat tccaagttat ctagtttttt ccaagctgtc 
actaatcaca agtgggttgc tgtgggagct gcagttggta ttcttggatt gctagtggga 
ggatggtttg tgtataagca tttttcccgc aaagaggaag aaccaattcc agctgttggg 
gtttatcatg gagtgactaa gcccaaacaa gtgattaaat tggatgcaga tccagtagag 
tctcagttga ctctagaaat agcaggatta gttaggaaaa atttggttca gtttggagtt 
ggtgagaaaa atggatgtgt gagatgggtc atgaatgcct taggagtgaa ggatgattgg 
ttgttagtac cttctcatgc ttataaattt gaaaaggatt atgaaatgat ggagttttat 
ttcaatagag gtggaactta ctattcaatt tcagctggta atgttgttat tcaatcttta 
gatgtgggat tccaagatgt tgttctaatg aaggttccta caattcccaa gtttagagat 
attactcaac attttattaa gaaaggagat gtgcctagag ccttgaatcg cttggcaaca 
ttagtgacaa ccgttaatgg aactcctatg ttaatttctg agggaccttt aaaaatggaa 
gaaaaagcca cttatgttca taagaagaat gatggtacta cggttgattt gactgtagat 
caggcatgga gaggaaaagg tgaaggtctt cctggaatgt gtggtggggc cctagtgtca 
tcaaatcagt ccatacaaaa tgcaattttg ggtattcatg ttgctggagg aaattcaatt 
cttgtggcaa agttgattac tcaagaaatg tttcaaaaca ttgataagaa aattgaaatc 
aagctt 

<210> SEQ ID NO: 
<211> LENGTH 
<212> TYPeC^^DIi^^^ 
<213> ORGANISM <^himeric Sequence 




<220> FEATURE: 

<221> NAME/KEY: sig_peptide 

<222> LOCATION: (1) . . (19) 

<223> OTHER INFORMATION: Sequence #18. 

Synthetic fragment corresponding to the 
KDEL endoplasmic reticulum retention signal 
sequence . 

<400> SEQUENCE: 18 

atcaaggatg aattgtaat 

<210> SEQ ID NO: 19 

<211> LENGTii:_^2Ll 

<212> TYPB^ADI}^^^ 

<213> ORGANISM: (Chimeric Sequence 

<220> FEATURE: 

<221> NAME/KEY: sigjpeptide 
.<222> LOCATION: (1) , . (21) 

<223> OTHER INFORMATION: Sequence #19. 

Synthetic fragment corresponding to the KDEL 
endoplasmic reticulum retention signal sequence. 

<400> SEQUENCE: 19 

cgattacaat tcatccttga t 

<210> SEQ ID NO: 20 




2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3426 



19 



21 
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RAW SEQUENCE LISTING DATE: 07/11/2005 

PATENT APPLICATION: US/10/540,615 TIME: 15:15:15 

Input Set : A:\PTO.DA.txt 

Output Set: N:\CRF4\07112005\J540615.raw 



520 gtcctatcaa cggactgaat catttgctct tcctcaatat ccatccc 47 
524 <211> LENGTH: 3426 ^ ^CL'*'^^^ 



523 <210> SEQ ID NO: 17 Qn.^ QJXxA 

E--> 525 <212> TYPE ; ^DN^|^ ^ 

526 <213> ORGANI SM : ^Hepa t i t i s A virus 

528 <220> FEATURE: 

529 <221> NAME/KEY: gene 

530 <222> LOCATION: Complement ( (1) .. (3426) ) 

531 <223> OTHER INFORMATION: Sequence # 17 

532 Sequence coding for the modified open reading 

533 frame (?ORFm) of the Cuban M2 strain of the HAV-. 

534 This sequence does not have the gene coding 
W--> 535 for the VP4 protein. 

537 <400> SEQUENCE: 17 

53 8 gggatggata ttgaggaaga gcaaatgatt cagtccgttg ataggactgc agtgactgga 60 

539 gcttcttatt tcacttctgt ggaccaatct tcagttcata ctgctgaggt tggctcacac 120 

540 caaattgaac ctttgaaaac ctctgttgat aaacctggtt ctaagaaaac tcagggggag 180 

541 aagtttttct tgattcattc tgctgattgg ctcactacac atgctctctt tcatgaagtt 240 

542 gcaaaattgg atgtggtgaa actgctgtac aatgagcagt ttgccgtcca aggtttgttg 300 

543 agataccata cttatgcaag atttggcatt gagattcaag ttcagataaa tcccacaccc 360 

544 tttcagcaag gaggactaat ctgtgccatg gttcctggtg accaaagtta tggttcaata 420 

545 gcatccttga ctgtttatcc tcatggtctg ttaaattgca atatcaacaa tgtagttaga 480 

546 ataaaggttc catttattta tactagaggt gcttatcatt ttaaagatcc acagtaccca 540 

547 gtttgggaat tgacaatcag agtttggtca gagttgaata ttggaacagg aacctcagct 600 

548 tatacttcac tcaatgtttt agctaggttt acagatttgg agttgcatgg attaactcct 660 

549 ctttctacac agatgatgag aaatgaattt agagttagta ctactgaaaa tgttgtaaat 720 

550 ttgtcaaatt atgaagatgc aagggcaaaa atgtcttttg ctttggatca ggaagattgg 780 

551 aagtctgatc cttcccaagg tggtggaatt aaaattactc atttcactac ctggacatcc 84 0 

552 attccaacct tagctgctca gtttccattc aatgcttcag attcagttgg gcaacaaatt 900 

553 aaagttatac cagtggaccc atactttttc cagatgacaa acactaatcc tgatcaaaaa 960 

554 tgtataacag ccttggcctc tatttgtcag atgttctgct tttggagggg agatcttgtt 1020 

555 ttcgatttcc aggtttttcc aaccaaatat cattcaggta ggctgttgtt ttgttttgtt 1080 

556 cctgggaatg agttaataga tgttactgga attacattaa aacaggcaac tactgctcct 1140 

557 tgtgcagtga tggacattac aggagtgcag tcaaccttga gatttcgtgt tccttggatt 1200 

558 tctgatacac cctatcgagt gaataggtac acgaagtcag cacatcaaaa aggtgagtat 1260 

559 actgccattg ggaagcttat tgtgtattgt tataatagat tgacttctcc ttctaatgtt 1320 

560 gcttctcatg ttagagttaa tgtttatctt tcagcaatta atttggaatg ttttgctcct 1380 

561 ctttaccatg ctatggatgt taccacacag gttggagatg attcaggagg tttctcaaca 1440 

562 acagtttcta cagagcagaa tgttcctgat ccccaagttg gcataacaac catgagggat 1500 

563 ttaaaaggga aagccaatag gggaaagatg gatgtatcag gagtgcaggt acctgtggga 1560 

564 gctattacaa caattgagga tccagtttta gcaaagaaag tacctgagac atttcctgaa 1620 

565 ttgaagcctg gagaatccag acatacatca gatcacatgt ctatttataa attcatggga 1680 

566 aggtctcatt tcttgtgtac ttttactttt aattcaaaca ataaagagta cacatttcca 1740 
567- ataactctgt cttcgacttc taatcctcct catggtttac catcaacatt aaggtggttc 1800 

568 tttaatttgt ttcagttgta tagaggacca ttggatttga caattataat cacaggagcc 1860 

569 actgatgtgg atggtatggc ctggtttact ccagtgggcc ttgctgtcga caccccttgg 1920 

570 gtggaaaaga agtcagcttt gtctattgat tataaaactg cccttggagc tgttagattt 1980 

571 aatacaagaa gaacagggaa cattcagatt agattgccat ggtattctta tttgtatgcc 2040 

572 gtgtctggag cactggatgg cttgggagat aagacagatt ctacatttgg attggtttct 2100 
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RAW SEQUENCE LISTING DATE: 07/11/2005 

PATENT APPLICATION: US/10/540,615 TIME: 15:15:15 



Input Set : A:\PTO.DA.txt 

Output Set: N:\CRF4\07112005\J540615.raw 

462 tgggtggaaa agaagtcagc tttgtctatt gattataaaa ctgcccttgg agctgttaga 2040 

463 tttaatacaa gaagaacagg gaacattcag attagattgc catggtattc ttatttgtat 2100 

464 gccgtgtctg gagcactgga tggcttggga gataagacag attctacatt tggattggtt 2160 

465 tctattcaga ttgcaaatta caatcattct gatgaatatt tgtcctttag ttgttatttg 2220 

466 tctgtcacag agcaatcaga gttctatttc cctagagctc cattaaattc aaatgctatg 2280 

467 ttgtccactg agtccatgat gagtagaatt gcagctggag acttggagtc atcagtggat 2340 

468 gatcccagat cagaggagga cagaagattt gagagtcata tagaatgtag gaaaccatat 2400 

469 aaagaattga gactggaggt tgggaaacaa agaatcaaat atgctcagga agagttatca 2460 

470 aatgaagtgc ttccacctcc taggaaaatg aaggggttat tttcacaagc tgaattcctg 2520 

471 cagcccgggg gatccatggg aatttcagat gatgacaatg atagtgcagt agctgagttt 2580 

472 ttccggtctt ttccatctgg tgaaccatca aattccaagt tatctagttt tttccaagct 2640 

473 gtcactaatc acaagtgggt tgctgtggga gctgcagttg gtattcttgg attgctagtg 2700 

474 ggaggatggt ttgtgtataa gcatttttcc cgcaaagagg aagaaccaat tccagctgtt 2760 

475 ggggtttatc atggagtgac taagcccaaa caagtgatta aattggatgc agatccagta 2820 

476 gagtctcagt tgactctaga aatagcagga ttagttagga aaaatttggt tcagtttgga 2880 

477 gttggtgaga aaaatggatg tgtgagatgg gtcatgaatg ccttaggagt gaaggatgat 2940 

478 tggttgttag taccttctca tgcttataaa tttgaaaagg attatgaaat gatggagttt 3000 

479 tatttcaata gaggtggaac ttactattca atttcagctg gtaatgttgt tattcaatct 3060 

480 ttagatgtgg gattccaaga tgttgttcta atgaaggttc ctacaattcc caagtttaga 3120 

481 gatattactc aacattttat taagaaagga gatgtgccta gagccttgaa tcgcttggca 3180 

482 acattagtga caaccgttaa tggaactcct atgttaattt ctgagggacc tttaaaaatg 3240 

483 gaagaaaaag ccacttatgt tcataagaag aatgatggta ctacggttga tttgactgta 3300 

484 gatcaggcat ggagaggaaa aggtgaaggt cttcctggaa tgtgtggtgg ggccctagtg 3360 

485 tcatcaaatc agtccataca aaatgcaatt ttgggtattc atgttgctgg aggaaattca 3420 

486 attcttgtgg caaagttgat tactcaagaa atgtttcaaa acattgataa gaaaattgaa 3480 

487 atcaagctt ^ 3489 

490 <210> SEQ ID NO: 15 .^^JUt/^ 

491 <211> LENGTH^,.--5-3r--:::^ C^-^-^^^ 
E--> 492 <212> TYPE:(adN 

493 <213> ORGAN iSM-TT'^tTimeric Sequence"^^ 




495 <220> FEATURE: 

496 <221> NAME/KEY: 

497 <222> LOCATION: (1) . . (51) 

498 <223> OTHER INFORMATION: Sequence # 15. 

499 Synthetic fragment that reverts the 

500 transcription start of the vp2 protein. 

502 <400> SEQUENCE: 15 

503 gggatggata ttgaggaaga gcaaatgatt cagtccgttg ataggactgc a 51 

506 <210> SEQ ID NO: 16 n 

507 <211> LENGTH^.^ C 'O^Ut^ jMir<> 

E--> 508 <212> TYPEr ADN ^ ^\ a^^in^ 



509 <213> ORGANlBWf'^Xhimeric Sequence^^ 

511 <220> FEATURE: ^ -"""""^^ 

512 <221> NAME/KEY: gene 

513 <222> LOCATION: (1) . . (47) 

514 <223> OTHER INFORMATION: Sequence # 16. 

515 Synthetic fragment that reverts the transcription 

516 start of the vp2 protein (complementary chain) . 
519 <400> SEQUENCE: 16 
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RAW SEQUENCE LISTING DATE: 07/11/2005 

PATENT APPLICATION: US/10/540 , 615 TIME: 15:15:15 

Input Set : A:\PTO.DA.txt {^'^^^ /^\^y^y^ 

Output Set: N:\CRF4\07112005\J540615.raw ^ ^/"tVA,^ 



346 <210> SEQ ID NO 

347 <211> LENOT^*'^ 





E--> 348 <212> TYPB^ADJi 
349 <213> ORGANISM; 

351 <220> FEATURE: 

352 <221> NAME/KEY: p^ 

353 <222> LOCATION: (1) . . (25) 

354 <22 3> OTHER INFORMATION: Sequence # 11. 

355 Sequence of the oligonocleotide # 11 used 

356 for the amplification of 3C coding sequence by PGR 

359 <400> SEQUENCE: 11 

360 tctcagtcaa ctctagaaat agcag 25 

363 <210> SEQ ID NO: 12 

364 <211> LENGTH: 
E--> 365 <212> TYPE: 

366 <213> ORGANISM^— Ohimeric Sequence 

368 <220> FEATURE: 

369 <221> NAME/KEY: prTiTTeT?^^^ied- 

370 <222> LOCATION: (1) . . (21) 

371 <223> OTHER INFORMATION: Sequence # 12. 

372 Sequence of the oligonocleotide # 12 used for 

373 the amplification of 3C coding sequence by PGR 

376 <400> SEQUENCE: 12 

377 ataagcttga tcaattttct t , 21 

380 <210> SEQ ID NO: 13 

381 <211> LENGTHx^re 
E--> 382 <212> TYPE^ADN J 

383 <213> ORGANMM-f-^epatitis A virus 

385 <220> FEATURE: 

386 <221> NAME/KEY: gene 

387 <222> LOCATION: Complement ( (1) . . (978) ) 

388 <223> OTHER INFORMATION: Sequence # 13. 

389 Sequence corresponding to the region of .3ABC 
3 90 polyprotein with proteolytic activity having 
391 the selfprocessing sites mutated. 

3 94 <400> SEQUENCE: 13 

3 95 gaattcctgc agcccggggg atccatggga atttcagatg atgacaatga tagtgcagta 60 
396 gctgagtttt tccggtcttt tccatctggt gaaccatcaa attccaagtt atctagtttt 120 
3 97 ttccaagctg tcactaatca caagtgggtt gctgtgggag ctgcagttgg tattcttgga 18 0 

398 ttgctagtgg gaggatggtt tgtgtataag catttttccc gcaaagagga agaaccaatt 240 

399 ccagctgttg gggtttatca tggagtgact aagcccaaac aagtgattaa attggatgca 300 

400 gatccagtag agtctcagtt gactctagaa atagcaggat tagttaggaa aaatttggtt 360 

401 cagtttggag ttggtgagaa aaatggatgt gtgagatggg tcatgaatgc cttaggagtg 420 

402 aaggatgatt ggttgttagt accttctcat gcttataaat ttgaaaagga ttatgaaatg 480 

403 atggagtttt atttcaatag aggtggaact tactattcaa tttcagctgg taatgttgtt 540 

404 attcaatctt tagatgtggg attccaagat gttgttctaa tgaaggttcc tacaattccc 600 

405 aagtttagag atattactca acattttatt aagaaaggag atgtgcctag agccttgaat 660 

406 cgcttggcaa cattagtgac aaccgttaat ggaactccta tgttaatttc tgagggacct 720 

407 ttaaaaatgg aagaaaaagc cacttatgtt cataagaaga atgatggtac tacggttgat 780 
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408 ttgactgtag atcaggcatg gagaggaaaa ggtgaaggtc ttcctggaat gtgtggtggg 840 

409 gccctagtgt catcaaatca gtccatacaa aatgcaattt tgggtattca tgttgctgga 900 

410 ggaaattcaa ttcttgtggc aaagttgatt actcaagaaa tgtttcaaaa cattgataag 960 

411 aaaattgaaa tcaagctt 978 

414 <210> SEQ ID NO: 14 

415 <211> LENGTR^>489 ,-y£=>f..y.y^Ul^ ^ Y^/X^ ^ . 

E--> 416 <212> TYPErQAOT/ ^^jT^'V^^^^ 

417 <213> ORGANISM: Hepatitis A virus 

419 <220> FEATURE: 

420 <221> NAME/KEY: gene 

421 <222> LOCATION: Complement (( 1 (3489) ) 

422 <223> OTHER INFORMATION: Sequence # 14. 

423 Nucleotide sequence CODING for the new 

424 modified open reading frame (ORFm) of the 
42 5 Cuban M2 strain. 

428 <400> SEQUENCE: 14 

42 9 atgaatatgt ccaaacaagg aattttccag actgttggga gtggccttga ccacatcctg 60 

430 tccttggcag atattgagga agagcaaatg attcagtccg ttgataggac tgcagtgact 120 

431 ggagcttctt atttcacttc tgtggaccaa tcttcagttc atactgctga ggttggctca 180 

432 caccaaattg aacctttgaa aacctctgtt gataaacctg gttctaagaa aactcagggg 240 

433 gagaagtttt tcttgattca ttctgctgat tggctcacta cacatgctct ctttcatgaa 300 

434 gttgcaaaat tggatgtggt gaaactgctg tacaatgagc agtttgccgt ccaaggtttg 360 

435 ttgagatacc atacttatgc aagatttggc attgagattc aagttcagat aaatcccaca 42 0 

436 ccctttcagc aaggaggact aatctgtgcc atggttcctg gtgaccaaag ttatggttca 480 

437 atagcatcct tgactgttta tcctcatggt ctgttaaatt gcaatatcaa caatgtagtt 540 

438 agaataaagg ttccatttat ttatactaga ggtgcttatc attttaaaga tccacagtac 600 

439 ccagtttggg aattgacaat cagagtttgg tcagagttga atattggaac aggaacctca 660 

440 gcttatactt cactcaatgt tttagctagg tttacagatt tggagttgca tggattaact 720 

441 cctctttcta cacagatgat gagaaatgaa tttagagtta gtactactga aaatgttgta 780 

442 aatttgtcaa attatgaaga tgcaagggca aaaatgtctt ttgctttgga tcaggaagat 840 

443 tggaagtctg atccttccca aggtggtgga attaaaatta ctcatttcac tacctggaca 900 

444 tccattccaa ccttagctgc tcagtttcca ttcaatgctt cagattcagt tgggcaacaa 960 

445 attaaagtta taccagtgga cccatacttt ttccagatga caaacactaa tcctgatcaa 1020 

446 aaatgtataa cagccttggc ctctatttgt cagatgttct gcttttggag gggagatctt 1080 

447 gttttcgatt tccaggtttt tccaaccaaa tatcattcag gtaggctgtt gttttgtttt 1140 

448 gttcctggga atgagttaat agatgttact ggaattacat taaaacaggc aactactgct 1200 
44 9 ccttgtgcag tgatggacat tacaggagtg cagtcaacct tgagatttcg tgttccttgg 1260 

450 atttctgata caccctatcg agtgaatagg tacacgaagt cagcacatca aaaaggtgag 1320 

451 tatactgcca ttgggaagct tattgtgtat tgttataata gattgacttc tccttctaat 1380 

452 gttgcttctc atgttagagt taatgtttat ctttcagcaa ttaatttgga atgttttgct 1440 

453 cctctttacc atgctatgga tgttaccaca caggttggag atgattcagg aggtttctca 1500 

454 acaacagttt ctacagagca gaatgttcct gatccccaag ttggcataac aaccatgagg 1560 

455 gatttaaaag ggaaagccaa taggggaaag atggatgtat caggagtgca ggtacctgtg 1620 

456 ggagctatta caacaattga ggatccagtt ttagcaaaga aagtacctga gacatttcct 1680 

457 gaattgaagc ctggagaatc cagacataca tcagatcaca tgtctattta taaattcatg 1740 

458 ggaaggtctc atttcttgtg tacttttact tttaattcaa acaataaaga gtacacattt 1800 

459 ccaataactc tgtcttcgac ttctaatcct cctcatggtt taccatcaac attaaggtgg 1860 

460 ttctttaatt tgtttcagtt gtatagagga ccattggatt tgacaattat aatcacagga 1920 

461 gccactgatg tggatggtat ggcctggttt actccagtgg gccttgctgt cgacacccct 1980 
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275 
277 
278 
279 
280 
281 
282 
285 
286 
289 
290 

E--> 291 
292 
294 
. 295 
296 
297 
298 
299 
302 
303 
306 
307 

E--> 308 
309 
312 
313 
314 
315 
316 
317 
318 
321 
322 
323 
326 
327 

E--> 328 
329 
332 
333 
334 
335 
336 
337 
338 
341 
342 
343 



<213> 
<220> 
<221> 
<222> 
<223> 



ORGANISM 
FEATURE : 
NAME /KEY 
LOCATION 




<212> 

<213> 
<220> 
<221> 
<222> 
<223> 




primer_bind 
(1) . . (27) 
OTHER INFORMATION: Sequence # 7. 
Sequence of the oligonocleotide # 7 used for 
the amplification of 3A coding sequence by PCR. 
<400> SEQUENCE: 7 
ccatgggaat ttcagatgat gacaatg 
<210> SEQ ID NO: 8 

<211> LENGTH;.^-2^(=5>yj2 

TYPE ^^DN^^^t^— 

ORGANi^TTYChimeric Sequence 
FEATURE: \ 

NAME/KEY: >i?iffier_-bif 
LOCATION: (1) . . (26) 
OTHER INFORMATION: Sequence # 8. 
Sequence of the oligonocleotide # 7 used for 
the amplification of 3 A coding sequence by PCR. 
<400> SEQUENCE: 8 
ggatatcggt tcttcctctt tgcggg 
<210> SEQ ID NO: 9 
<211> LENGT: 
<212> TYPE 

<213> ORGANISM: ^Chimeric 
<220> FEATURE 
<221> NAME/KEY: gene 
<222> LOCATION: (1) . . (85) 
<223> OTHER INFORMATION: Sequence 



27 




26 



# 



9. 

Synthetic fragment coding for 3B protein 
carrying T by C and G by C nucleotide 
substutions, respectively. 
<400> SEQUENCE: 9 

tccagctgtt ggggtttatc atggagtgac taagcccaaa caagtgatta aattggatgc 60 
agatccagta gagtctcagt tgact . , *» 85 

<210> SEQ ID NO: 10 

<211> LENGTH: 89 
TYPEj^^A^^ 
ORGANIS^ 
FEATURE : 
NAME /KEY: gene 
LOCATION: (1) . . (89) 
OTHER INFORMATION: Sequence # 10, 



<212> 

<213> 
<220> 
<221> 
<222> 
<223> 




Synthetic fragment coding for 3B protein 

carrying T by C and G by C nucleotide 
substutions, respectively (complementary chain) . 
<400> SEQUENCE: 10 

ctagagtcaa ctgagactct actggatctg catccaattt aatcacttgt ttgggcttag 60 
tcactccatg ataaacccca acagctgga 89 
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